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Abstract 
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1 Introduction 

The largest clique problem (LCP), is the problem to find the largest complete subgraph 
of a given graph G. Let G = (V, E) be a graph. A graph g is a subgraph of G, g C G, if 
its vertex set V(g) C V and its edges E(g) C E. A subgraph g = (V(g), E(g)) is complete 
if for any i,j £ V(g) then 6 E(g). We will denote by /C(G) the set of complete 

subgraphs or cliques of G and by M.axCl{G) the set of the largest cliques in G: 



MaxCl{G) := {g € K{G) : \V{g)\ = max |y(</)| (1) 

g'eK(G) 
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where \B\ denotes the cardinality of the set B. We call clique number of the graph G, 
u(G), the cardinality of the vertex set of any largest clique in G, i.e., oj(G) = \ V{g)\ with 
g € MaxCl{G). Solving the LCP for a given graph G(V,E) implies finding u(G), and 
both problems are in fact in the same complexity class. Note that we are not strictly 
following the definition in pQ since we are using the term clique also for a non maximal 
complete subgraph of G. The LCP is one of the main example of ./VP-hard problem. It 
has been proven (see e.g. [GJ] and references therein) to be polinomially equivalent to the 
^-satisfiability problem and it is equivalent to many other well known difficult problems 
in combinatorial optimization. 

It is well known that the LCP remains difficult also when restricted to typical instances 
of Erdos random graphs with finite fixed density p, i.e. of graphs with n vertices, \V\ = n, 
in which each pair £VxV belongs to the edges set E with independent probability 
p. In particular it is well known that in such a random graph G it is very easy to find 
complete subgraphs g G K,(G) of size \g\ = — but is difficult to find cliques that exceed 
this size, see below. The clique number w(G) is almost deterministic, in a sense that will 
be stated more precisely below, and it is roughly speaking twice the size of the cliques that 
are easy to find. Recently some progress has been made in [8j to understand the intricated 
landscape of the LCP for Erdos random graph, and therefore to show why at the moment 
there are no algorithms able to find cliques of a size that exceeds the easy one. 

In a previous paper [6] , in collaboration with Antonio Iovanella, two of us introduced 
an algorithm to find cliques inspired by the cavity method developed in the study of spin 
glasses. This Markov Chain Monte Carlo exhibits very good numerical performances, in 
the sense that, although asymptotically it is not able to find cliques larger than the easy 
ones, for finite size effects it find cliques very near to the largest also for quite large graphs. 
The idea of the algorithm is the following: starting from a non feasible (i.e. non clique) 
configuration a of k vertices of V, the algorithm chooses the next configuration assigning 
to each new set a' of k vertices of V a probability proportional to e~^ H °( a,a )+ h ( k -q( (7 > a ))] ) 
where f3 is a parameter called inverse temperature. The function Hq is a non negative 
quantity defined by the number of missing edges between the two configurations, i.e. the 
number of pairs with i £ a, j £ a' and % 7^ j such that (i,j) E{G). The quantity 
q{a, a') represents the overlap between a and a' and then k — q(a, a') is the number of 
vertices in a 1 that are not in a. The transition probabilities depend therefore also on the 
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positive parameter h. 

The presence of /3Hq in the transition probabilities, when j3 is large, makes very low 
the probability to reach new configurations a' that are badly connected with a, while a 
large h depresses the configurations a' with many different vertices with respect to a. 

From a statistical mechanics point of view the dynamics above has various interesting 
features. First, the dynamics is conservative, since it is defined on the space of configura- 
tions with k vertices, moreover since the whole configuration can be renewed in a single 
step the resulting MCMC can be considered a canonical (or conservative) probabilistic 
cellular automaton (PC A). Rigorous results on canonical PC A are quite rare in the liter- 
ature. Second, H(a, a') = Hq + h(k — q) is in some sense the Hamiltonian of a disordered 
system of pair of configurations, and the combined action of Hq and h(k — q) makes the en- 
ergy landscape quite complicate. Third, good numerical performances stimulate a deeper 
understanding of the dynamics. 

For this reasons we decided to study in more detail the statistical mechanical system 
described by the chain in the case of random graphs. We prove several results. First of 
all it can be proved rigorously that for suitable values of k, including the interesting case 
k = oj(G), the annealed analysis corresponds to the quenched one. 

Then it can be proved the existence in the plane (j3, h) of a nontrivial phase diagram. 
More precisely, the system exhibits a first order phase transition while the pair j3, h crosses 
a line h c ((3). At h > h c (/3) the phase is characterized by pairs of configurations a, a' with 
a = a' and with a given density of missing links in a, depending on (3. At h < h c (/3) the 
phase is characterized by pairs of disjoint configurations a, a' with again a particular value 
for the density of missing links between a and a' depending on /?. 

Moreover, in the region below the critical line h c ((3), a second phase transition is 
present, and again it has a transparent "physical" interpretation: for temperature T = 
jjj below a critical value T c the system tend to oscillate indefinitely between two fixed 
configurations a and a', while above T the new configuration at each step is typically 
different from the configurations previously visited by the system. 

This detailed control on the features of the system is achieved by a careful evaluation 
of the thermodynamics. In particular the proof of the existence of the phase transitions 
can be performed in a relatively easy way, computing the annealed partition function of 
the system. The self averaging of the system, i.e., the equivalence between quenched and 



4 



annealed, is more complicate to prove rigorously, involving the computation of the second 
moment of the partition function, and it is more a brute-force computation. We will 
present it in some detail in the paper, in an almost pedagogical way, because, as far as 
we know, there are few cases in the literature where a phase transition for a disordered 
system can be controlled rigorously. Moreover the way we achieve this result, although 
based on classical argument like the saddle point method, has some technical details that 
are quite interesting and may be useful also in different contexts. 

Of course this analysis gives important information on the choice of the parameter 
used in simulation, and in a following paper we will discuss its application to the study of 
the convergence to equilibrium of the dynamics. 

To be more precise we need now some definition. 

I. 1 Random graphs and the clique number 

In this section we fix definitions and notations on random graphs and we recall well known 
results on the clique number. 

For all p £ [0, 1] consider the probability space given by an infinite sequence of indepen- 
dent Bernoulli variables of parameter p, i.e., oj € U := {0, 1} N , oj = (oi, 02, a/, ...) with 
ai 6 {0, 1}, with cr-algebra generated by A\ := {uj : ai = j}, j = 0, 1 and with probability 
measure 

F(u : a h = jx, ...a ik = j k ) = p^.-.p^ with V\ = P, Po = 1 - P- 

Given a set of vertices V = {1, n} we associate to it the probability space £l n given by 
the first (2) Bernoulli variables in describing the edges between vertices in V, with the 
obviuos ordering (1, 2), (1, 3), (2, 3), (1, n), (2, re), (re— 1, n). In this way we represent 
with the probability space usually denoted by £/(N,p), i.e., the infinite random graph. 

For any G € <7(N,p) and re S N we denote by G n the subgraph of G spanned by 
the set V n := {1,2, ...,n}, i.e., the subgraph of G containing all the edges of G that join 
two vertices in V n . By definition G n is Q n measurable. We will denote by P and E the 
probability and the mean value respectively, on this probability space. 

The following well known result on the clique number can be found in [1] (Corollary 

II. 2, pg 286): 
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Proposition 1.1 For a.e. G G £7(N,p) f/iere is a constant mo(G) such that ifn> mo(G) 
then 



w(G n ) - 2 log b n + 2 log fe log b n - 2 log 6 (-) - 1 



3 

<2 



with b := -. 

v 



The main tool in the proof of this Proposition is the study of the random variable Y r (n) 
defined as the number of complete subgraphs of G n with r vertices, i.e., the number of 
r-cliques in G n with the second moment method . Indeed its mean value is given by: 



EY r (n) = rjp® =: /(r,n) ~ b rXo ^ n ~4 



with b := -. The function f(r,n), as a function of r, has its maximum in r ~ log b n and 
drops rather suddenly below 1, by increasing r, say at r ~ 21og fe n. Moreover, again by an 
explicit calculation, Y r (n) satisfies the following inequality 

varY r (n) , A 9 i 

wm?- brn +2{EYr) ' 

when (1 + rf)\og b n < r < 31og b n, for r] E (0,1). With the Borel-Cantelli lemma, it is 
easy to show that, given e G (0, 5), for almost every graph G G ^(N,p) there is a constant 
mo(G) such that if n > mo(G) and n', < n < n r+ i then uj(G n ) = r, with 

n r := max{n G N : f(r,n) < r ~( 1+£ ^} n' r := min{n G N : f(r,n) > r 1+£ }. (2) 

Indeed the size k of the interesting cliques can be parametrized by a real c G (1,2] since 
the relation between n and the size k of the cliques that we want to study is given by 

, In 1 /p 

\nn = k — , with cG (1,2 . 

c 

As emerges in ([2]) it's more efficient to use k as parameter, instead of n, to study the 
asymptotic behavior for large graphs and so for any c > 1 we define 

og := {(nfcjfcx) : hm — — = — = — } (3) 

fc->oo K C 

This means that if we define = k we cons ider sequences nt such that limfc_ s . 0O = c. 
This is actually a particular asymptotic regime that could be generalized. 
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Let Y be a random function on the probability space Q, associating to a pair (n, k) a 
random variable Y(n, k) on S7 ra depending on k, for instance the number of k cliques in 
G n , considered before. 



Definition 1.2 A random function Y on the probability space SI is called c- asymptotically 
self averaging, if the random variables ^(nfl) conver 9 e almost surely to 1 for any (nk)k>o 
in S 5 as k — ^ oo, uniformly in S e - 

This means that there exists f2 C f2 with P(f2) = 1 such that for every uj £ 0, and any 
(nk)k>o S 5g the random variable ^y^A^ converges to 1. 

Note that, by the Borel-Cantelli lemma a sufficient condition for self-averaging is the 
following: 



varY(rik, k) 



< n- a e°^ (4) 



(El> fc ,fc)) 2 

for some a > 1 with o(k) uniform in (n^) G 5c- Indeed for any e > we have that 
l E y^ fc ) - !| > e for some n = n fc> ("k)fc>o e 5 5 J < 

e 2 v Ey(n fc ,/c) 7 - e 2 

is summable on fe, since (nfc)fc>o G 5 g implies that c& := fc converges to c, so that with 
at most finitely many exception on k, we have that I EY^n^k) ~M < £ f° r anv ( n k)k>o £ <5c- 
We also note that if Z is c-asymptotically self averaging we have that In Z(rik,k) — 
lnEZ(rik, k) converges almost surely to 0. 

1.2 The cavity algorithm 

Let V = {1, ...,n} and define for each unordered pair in V x V 

(0 i£(i,j)€E 
Jij = \ (5) 

We consider the space X^ := {0, lji 1 '---'™} of lattice gas configurations on V and we 
will denote by the same letter a configuration a £ X^ and its support a C V . On this 
configuration space X we can consider an Ising Hamiltonian with an antiferromagnetic 
interaction between non-neighbor sites: 
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H(a) := ^2 JijVicrj - h^ai (6) 

where h > 0. It is immediate to prove that when h < 2 the minimal value of H(a) 
is obtained on configurations with support on the vertices of a maximum clique. In 
the case of a random graph G, i.e., when the interaction variables Jij are i.i.d.r.v., the 
Hamiltonian ([6]) is similar to the Hamiltonian of the Sherrington-Kirkpatrick(SK) model. 
The main differences are that we use lattice gas variables instead of spin variables and, 
more important, the interaction is given by Bernoulli variables. 

For each a G X^ n > we define its cavity field (or molecular field) as the field created in 
each site i by all the sites in the configuration a: 

h l (a) = ^2j ij a j + h(l-a i ) Vi G V. (7) 

We consider the canonical case, i.e., for any integer k < n we define the canonical 
configuration space 

4 n) :={aeXW : J>i = fc} (8) 
For each pairs of configurations a, a' G xf? 1 we can define the pair hamiltonian: 

H(a, a') = ^2 JijViv'j + h ~~ a ^ a 'i = z2 h i( a ) a i ( 9 ) 

This hamiltonian is non-negative and vanishes when a = a' and its support is a fc-clique. 
For every a, a' G xjf 1 ^ the transition probabilities of the cavity algorithm are given by: 

with 

Z a = £ e-? H ^\ (11) 
rex k 

By an immediate computation we can check that the detailed balance condition w.r.t. 

the invariant measure on X^ 



= ^ fc -Z^7^ = ^ (12) 



is verified with partition function Z: 

Z(n,k) = Z(4 n) )= Z °= E e-W'rt. (13) 

We will denote by //(.) the mean w.r.t. this stationary measure. For large /3, this stationary 
measure is exponentially concentrated on cliques. 

Note that at each step all the sites are updated; this dynamics could be considered 
a canonical version of probabilistic cellular automata (PCA). Given a fixed configuration 
a the probability measure on x[ given by 7r<j(.) := P(cr, .) can be considered in the 
frame of the Fermi statistics. Indeed the cavity fields hi(a) have values ei <r = I + rh with 
/ G {0, 1, A;} and r G {0, 1}. We define 

l ll :={ieV:h i (a) = l + h} l = 0,...,k. (14) 

By equation @ we have 

H((r, t) = E M ") 7 "* = E e/ . r E ^ =: E "V^r ( 15 ) 

i i,r idXi r l,r 

where i%i r denotes the occupation of the level (or cell) (l,r). On the other hand each 
level consists of g^ r := \Ii r \ subcells (or sublevels) containing at most one particle since 
for every i G X; >r we have n G {0, 1}. This means that instead of configurations in X^ 1 
we can consider the occupation numbers of the levels {wi r }l=o... fcr=oi- This statistical 
system is called a Fermi gas, see [3] for more detail on sampling for the Fermi statistics, 
and thus on the realization of this single step of the dynamics. 

As far as the energy levels corresponding to sites not in a, i.e., with r = 1, are 
concerned, we have that their number of sublevels, = \{i G V : hi(a) = I + h}\, is 
"almost deterministic", as discussed in [6J. Indeed they follow a binomial law, and precise 
results can be found in Lemma 15.41 in Section [5j 

A final remark on probability measures can be useful. The invariant measure yu(<r) is 
not a Gibbs measure, as usual with PCA, but we can define a Gibbs measure on pairs 
of configurations, i.e., on X^ x X^ as /^(cjO 7 ) = -^e~P H ( a,a ) with the same partition 
function Z(n,k) given in f)13|) . Actually the invariant measure \i can be considered the 
marginal of [ii. The probability measure 7r CT (.) = e introduced above in the discus- 
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(n) 

sion on the Fermi statistics can be considered as the conditioned measure on , since 
we have the relation: 

/x 2 (o-,t) = h{o-)k<j{t). (16) 

1.3 Results 

The main results presented in this paper are summarized by the following: 
Theorem 1.3 For each p <E (0, 1) and j3 <E (0, oo] 

i) let n and k be integers such that c := k > I, defining h := |, there is a critical 



value of h defined by 



k= i(im- m+ mi) (17) 



with f((3) := — In [p + (1 — p)e @] for which 



k 2HiM + k _ fm H!^ll -k\nk + o(k) ifh>h, 

ln(EZ(Aff j )) 



(™)\\ _ J — c r * ~ ■/ 2" 

2A , 2 in(^£) + 2fc - /J^fc 2 - /(/3)A; 2 -2klnk + o(k) ifh<h c 

(18) 



«; if c€ (1,2] 



^z(x^) < n _ 2eQ(fc) 

(EZ(^)) 2 " 



with o(k) independent of n and c, so that the partition function Z is c- asymptotically 
self averaging for c € (1, 2]; 

Hi) the line h c corresponds to a first order phase transition, in particular the phase 
with h > h c is characterized by configurations a with H(a,a) ~ k 2 f'(2(3) and the 
phase with h < h c is characterized by pairs of disjoint configurations (a, a') with 
H(a,a') ~ k 2 f'(P) + k 2 h; 

iv) if c £ (1,2] and for any {nk)k>o £ $c and h / h z define the entropy S(n^,k) := 
— v (n k ) M ") ln(^(cr)), then T^S(nk,k) converges almost surely to the following 
non random function: 

( lni/p _ /(2/3) _|_ '{2(3) in the parameter region (A) : h > h 5 

2 lnl/p _ + pj,^ . n ^ . ~ h< ~ h _ p > p_ ^ 

ln y p in the region (C) : h < h e and (3 < /3 S 

10 



4. 




where f3 5 is a zero of the function 

c(/3) = lnl/p_ /(/3)+/?//(/3) (21) 

so that also S(rik,k) is c- asymptotically self-averaging. The function s is discontin- 
uous along the line h = h s and has a discontinuity in its first derivative at T 5 = -hz 
corresponding to a "low temperature phase transition". The asymptotic value of the 

~ (n ) k 2 lnl//p 

entropy in the phase h < h s and (3 < /3g is maximal since \Xu | x e Cfc . 
The phase diagram is summarized in Figure 1. 
Remark 1.4 We can write 

Z = J2 e~P E N E (22) 

E 

with E running on all the possible values of the energy H(a, r) and Ne being the number 
of pairs of configurations o~,t with H(a,r) = E. For (3 = oo this implies that Z = Nq. 
Hence the self- averaging of Z implies the self-averaging of the number of cliques of any 
size k corresponding to c £ (1,2]. This generalizes the Bollobas result quoted above. 

Remark 1.5 Even though the relevant case for the clique problem is c G (1,2], for c > 2 
we can prove (see Appendix [0^1 that, with (3 C < oo the unique solution of 

/(2&)-i/(48 c ) = i]nl/p, 

for all (3 < f3 c we still have the estimate \19\) . Therefore quenched quantities behave like 
annealed ones for (3 < (3 C . In addition we can prove the existence of a second value for the 
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inverse temperature, say j3 c > f3 c such that for f3 > f3 c and (3 > 2(3 C , respectively for h > h c 
and h < h c , quenched quantities certainly differ from annealed ones. Indeed if j3 c > f3 c is 
such that 

/(2/3 c )-2&/'(4&) = -hil/p, 

c 

the estimated entropy for ^2 obtained from the annealed quantities turns out to be asymp- 
totically negative, i.e., for (nk)k>o in $c, 

for (3 > 0c in the case h > h 5 and j3 > 2/3 s in the case h < h s . Then, for j3 > f3 e and 
f3 > 2/3 g respectively, quenched quantities certainly differ from annealed ones. We actually 
expect a third phase transition at these temperatures: conversely, for f3 < /3 E and f3 < 2(3 5 
respectively, quenched quantities should behave like quenched ones. 

Notation: 

For notation convenience in what follows we adopt the following simplification: given c > 1 
and (nfc)fc>o £ S s we write n = n k and c = c& 



2 The annealed partition function Z 



In the case of random graphs is not difficult to compute the annealed partition function: 

Z:=EZ = E\ i ^ e~ ma ' T) ■ (23) 

Let I := a (It, and q be the overlap q := \I\. We denote by Ho(a,r) the first part of 
the pair hamiltonian, i.e., the pair hamiltonian evaluated for h = 0: 

H (a,T) = ^ JijO-iO-j = ^ JijiViTj + O-jTi) (24) 

i,jeV,i^j {i,j} 

The quantity CjTj + o~jTi takes values 0,1,2 as given in Table 1, where we denote by 
S := a\I, T := t\I and C := (o"Ur) c . So <7jTj + OjTi = 1 for {i,j} in the set of unordered 
pairs Si = (S x I) U (T x /) U (S x T) and a^j + ajn = 2 for {i , j} e £ 2 = I x I. By using 
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Table 1: Values of <7jTj + ajTi 



the independence of the random variables J« we can conclude: 



Z = J2 E e~^ k -^ H Ee-" J « J] Ee~ 2 ^. 

9=0 a,TEX k : \I\=g {ij}e£i {i,j}eS 2 



Since Ee~P J v = e"^ and Ee~ 2 ^ = e~^ 2 ^ with = - In [p + (1 - p)e~' 3 )] and 

defining 

&(q) := In 



n \ /2fc — 



2(* - q) 
k — q 



we have 



k k 
Z = ^ e e(q) e -ph(k-q)~f((3)(2q(k- q )Mk~<l) 2 )-fm !1 ^ _ ^ p 6i ,/l-<I»( 7 i 

g=0 9=0 



(25) 



£ e »W+*W (26) 



with 



<%) := -f3h(k - q) - f((3)(k 2 - q 2 ) - /(2/3) 



(27) 



We collect in Appendix [A] the main properties of the function f(/3). 

As far as the entropic term O is concerned we can use the Stirling formula n! = 
(^"n^^e^i^) to obtain the following asymptotic behaviors for m and k large with 
m = o(n) and g < k: 



ft \ in Tft 

] = expjmlnn — mlnm + m + o(m)\, ln(n — m) = Inn \- Oi — — ) (28) 

m/ n n z 



expjfcln k — (k — g) \n{k — g) — glng + o(k)}. 



(29) 



With the definition Inn = fc ln ^^ p ^ we can write: 



Q(q) = (2k - g)*; 11 ^ 1 /?) + 2 & _ g _ glng _ 2 (jfe - g) ln(fc - 9 ) + o(jfe) 
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We can estimate Z for large k with the saddle point method looking for the maximum 
of the function 0(g) + <3?(g). Indeed InZ = m&x qe ^ k ](Q(q) + $(?)) + OQnk) We have 
0(g) + <&(g) = a(q) — b(q), where a(q) is a polynomial with degree less or equal 2, i.e., with 

a(q) = (2k - c^^lM + 2 k-q- (3h(k - q) - f(f3)(k 2 - q 2 ) - /( 2/ 9)^f^, 



and 6(g) the remaining part: 

b(q) = glng + 2(k - q) hi(k - q) + o(k) 

By noting that /(/3) is a concave function so that f{/3) — % > we have that a(q) 
is a convex parabola, and so with maximum in or k, while b(q) is non negative and 
I b(q) | < 4klnk. We have that, for sufficiently large k, the maximum of a(q) is obtained in 
qmax = k if h > /i c , (see equation (|17p for the definition of h c ) and in q ma x = if ft < ft c . 
By simple calculations we have that these points k and correspond to the maximum also 
for the function 0(g) + $(g) = a(q) — 6(g) when h > h c and /i < /i c , respectively. Indeed in 
the two different cases it is immediate to verify that in a neighborhood of q ma x, i-e., in the 
intervals [k — k a ,k] and [0, with a G (0,1), respectively, the following estimates hold 
for the variations of the functions a and 6: for sufficiently large k, there exists a positive 
e such that 

|Aa(g)| := |a(g + 1) - a{q)\ > ek, |A6(g)| = o(k) 
and for g outside these intervals a(q) < a(q max ) — ek 1+a , so that o(g) — 6(g) < a(g) < 

Q'i.qmax) ^ ^(?macc) b(c[ max ). 

Summarizing we have, for h > h c : 

In z = k 2 + k-f(2(3) k{k ~ 1] -k\nk + o(k) (30) 

c 2 

and for h < h c 

InZ = 2fe 2 ^-^ + 2A; - /3/ifc 2 - /(/3)A; 2 - 2A:lnA: + o(Jfe) (31) 
c 

For large we obtain that p- In Z is a continuous function with a discontinuous derivative 
in /3 when h = h c , corresponding to a line of a first order phase transition as discussed in 
Section HI 
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Table 2: The value of OiTj + <x,Tj + c^rj + u'^t[ for different i,j 

3 The asymptotic self-averaging of Z 

The proof of self averaging of Z is a crude calculation based on elementary arguments. We 
first evaluate the second moment of Z proving that asymptotically it behaves like Z 2 . An 
upper bound for ^0- is obtained with a more detailed computation based on the same 
tools. 

3.1 The second moment of Z 

We evaluate the second moment of Z: 

E(Z 2 ) = E( e-WW+KV f)^ (32) 

By defining, as before, q = \a D t| (and similarly q') we have 

H(a, t) + H(a', t') = J tJ {a tTj + a jTi + aft + <t#) - h(2k -q-q') (33) 

{id} 

The quantity a^j + ajTi + cr^Tj + a^T- takes values 0, 1,2,3,4 as in the Table 2, where 
we use the previous notation, i.e., I := a (It, S := a\I, T := t\I, C := (a U r) c and 
similarly for the sets I',S', T' and C We also use the notation SS' for the set S D S' and 
so on. The table is symmetric due to the symmetry in the exchange i o j so we write only 
the upper triangle. For every I £ {1,2,3,4} again we denote by E\ the set of unordered 
pairs {i,j} where a^j + ajTi + c^j + a'M = I. By the table we have: £4 = II' x II', 
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Table 3: Index of the intersections 

£ 3 = (ST x tI') U {IS' x It') U {II' x IT') U (//' x TI'), and so on. With these notations 
we can write for the second moment of Z: 

nz 2 ) = z ZzZ e ~ phi2k ' q ' q,) E II ^"^x 

9=0 9'=0 o-,r: \<TCtr\=q, {i,j}e£i 

<t',t': \a'C\T'\=q' 



{i,j}££'2 {i,j}e£:i {i,j}&£4 



(34) 



For shortness we will denote by g r the cardinality of the intersection of the different subsets 
involved in this table, where the index r £ {1, 2, 9} is fixed in Table 3, e.g g\ := \SS'\. 
The cardinalities g r have the following constraints: 



9i + 92 + 53 < k - q, 94 + 95 + 96 < Q, 97 + 9s + 99 < k - q, 



(35) 



91 + 94 + 91 < k - q', 92 + 95+ 9s < q', 93 + 96 + 99 < k - q'. (36) 
The cardinality of the sets given by intersections with C or C' is obtained by difference: 

gs ~\SC'\=k-q-( gi +g 2 +g 3 ), g s > := \S'C\ = k - q' - { 9l + 54 + g 7 ), (37) 

gi ■ = \IC'\ =g- (54 + 55 +56), 9l> ■= \l'C\ = q - (52 + 55 + 5s), (38) 
5t := \TC\ =k- q- (57 +58 +59), 5T' := \T'C\ = k - q' - (g 3 + g 6 + g 9 ), (39) 
By defining 5 = 51 + 52 + ••• + 59 and M(gi, ...,gg,n,q, q') the multinomial coefficient 



M(gi, ...,g 9 ,n,q,q) 



9i } --99^9s^9l } -9T } -9s' } -9l' } -9T'K n ~ (^k - q - q' - g))\ 



16 



we can write 

k k (2fc~g)A(2fc-g') 

E (^ 2 ) = EE E E M( 5l ,..., 59 ,n, 9 , g Oe $(9)+ * (9 ' )+ * (M '' 9 ' 9l> -' S9) (40) 

<j=0g'=0 9=0 91,— ,99 



where the sum over g\,..., gg satisfies the constraints (|35|) . (|36j) and g\ + g 2 + ••• + 59 = 9 
and with $ defined in (|27p and ^ given by: 



55(55 - 1) 



2/(2/3) -/(4/3))+ J 



2 

r=l 



where the coefficients C r are defined as follows: 



Ci = 2/03) - /(2/3) ( 55 + 56 + 58 + 59) (41) 



C 3 = (2/03) - /(2/3)J ( 54 + 55 + 57 + 5s) (42) 

C 7 = (2/09) - /(2/3)) (52 + 53 + 55 + 5e) (43) 

C 9 = (2/(/3) - /(2/3)) (51 + 52 + 54 + 55) (44) 

C 2 = (2/09) - /(2/3)) (54 + 56 + 57 + 59) + (/(/3) + 7(2/3) - /(3/3)) (55 + 5s) (45) 

C 4 = (2/09) - 7(2/3)) (52 + 53 + 58 + 59) + (/OS) + 7(2/3) - /(3/3)) (55 + 5e) (46) 

C 6 = (2/(/3) - 7(2/3)) (51 + 52 + 57 + 5s) + (/(/3) + 7(2/3) - /(3/3)) {g 4 + 55) (47) 

C 8 = (2/09) - 7(2/3)) (51 + 53 + 54 + 56) + (/OS) + 7(2/3) - /(3/3)) (52 + 55) (48) 

C 5 = (2/09) - 7(2/3)) (51 + 53 + 57 + 59) + (/(/3) + 7(2/3) - 7(3/3)) (52 + 54 + 56 + 5s) (49) 

We denote by V the region of parameters q,q',g,gi, ---^9 defined by the constraints 
< q < k, < q' < k, g = g x + ... + g 9 and (J35J and flggj) . 

Lemma 3.1 For any /3 6 (0, 00) and 7 or an 2/ 9; (/> 5i> ••-> 59> 5 £ ^ anc ^ / or c < 2 we have 

■$(q,q', gi ,...,gg,g) < ^>{q,q',g,g 5 ) (50) 

w/iere 

* = 95(95 ~ 1} (27(2/3) - 7(4/3)) + i (/(/3) + 7(2/3) - 7(3/3)) ((fc A g) + g 5 )(g - g 5 ) (51) 
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The proof of Lemma 13. II is given in Appendix ICl 

As far as the entropic term is concerned we can write 



< 

^ gi\...gg\gslgilgT 1 -gs ,1 -gi' 1 -9T'K'n — (4k — q — q' — g))l ~ 

31,92,33,34,96,37,98,39 yi yy y ° yJ yl ^ yJ yl y y H H ij " 

< exp{(4k-q-q'-g)lnn-ln ({q-g)\(^-g)\\-2la ((&-<?- g)\{k-q' - g)l\ +8} =: e 02 

(52) 

where the sum is under the condition g\ + g2 + gj, + gA + g& + 97 + gs + 99 = 9 — 95 and 
with the notation ml = 1 if m < 1 and where, for the sum of g r with r / 5, we used the 
estimate 

> = < e 8 . 

31,32,33,34,36,37,38,39 a a a S " S a ^ Vy y ; 

With these estimates we can write 

k k (2k-q)A(2k-q') gAqAq' 

E(Z 2 ) <^2^2 ^2 ^2 e® 2 ( q ' q ' ' 9 ' 9 ^ + ®^ + ® ( - q ' )+ ^ ( - q ' q ' ' 9 ' 95 ^ (53) 

q=0 q'=0 9=0 95=0 

To evaluate this sums again we look for the maximum of the exponent. Define for notation 
convenience <&2{q,q') = + 

Lemma 3.2 The maximum of the function 02 + $2 + ^ on the parameter region defined 
by the constraints is obtained for q = q' i.e., for q = q',g,gs in the three dimensional 
polyhedron V defined by 

0<q<k, g 5 <g<2k-q, < g 5 < q (54) 

and represented in Figure 2. Moreover (02 + $2 + ^)(q,Q, g, 55) reaches its maximum on 
V in (k, k, 0, 0) if h > h c and in (0, 0, 0, 0) if h < h c . 

The proof of Lemma 13,21 is given again in Appendix [Cj Note that when g = and 
q = q' we have the expected relations ©2 = 20 + O(^), and ^ = 0. 
With these lemmas we immediately obtain 



lnE(Z 2 ) = max_(0 2 + $2 + *)(g,g,5,55) + C , (lnA:) = 2 InEZ + 0(ln £;) (55) 

9,9,95€:P 
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Figure 2: The polyhedron V 

3.2 Self averaging 

To evaluate the quantity we can write 

k k (2fe-g)A(2fe-g') , *(,)+*(,') 

^^C ^ g ^ g9 gil-~99l9sW9Tlgs'l9l'l9T'Kn- {±k - q - q 1 - g))\ 

and note that, as in the case of the clique number, the terms corresponding to g = and 
g = 1 are identical in E(Z 2 ) and in (EZ) 2 , indeed \P = not only for 3 = but also in 
the case g = 1. Therefore 

fc fc (2k~q)A(2k~q') gAqAq' 

<iEE E £e S ' + *° + * (56) 



(EZ) 2 - Z 2 



Lemma 3.3 The maximum of the function (02 + $2 + ^)(q,q' ',9,95) on the parameter 
region V with the additional constraint g > 2 is equal to 

-f(2/3)k(k - 1) + (2k - 2) Inn - 2 In (A; - 2)! + o(k) if h > h c 
-2(3hk - 2f((3)k 2 + (4k - 2) Inn - 41n(/c - 2)! + o(k) ifh<h c 
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The proof of this Lemma is analogous to that of Lemma 13.21 given in the Appendix [Cl 
With this Lemma we conclude the self averaging result. Consider first the case h > h c : 



varZ 



< 



cxp 



{ 



k In n + k - /(2/3) — — - - kink + o{k) 



+ 



-f(2p)k(k - 1) + {2k - 2) Inn - 21n(Jfc - 2)! + o(k)X < 



< exp | - 2k + 2k In k - 2 In n - 2(k - 2) ln( ) + o(k)j 



exp | — 2 In n + o{k) j = e 



21nn+o(fc) 



and using the asymptotic In ~ § we obtain the self averaging in this case. In the case 



h < h r the calculation is similar: 



varZ 
(EZ)2 



< exp | - 2 2k Inn + 2k - fihk 2 - f((3)k 2 -2k\nk + o(k) 
-2/3hk - 2f((3)k 2 + (4k - 2) Inn - 41n(ife - 2)! + o(Jfe)} 



<exp| -4/c + 4/cln/c-21nn-4(A:-2)ln( )+o(fc)} < e 

4 Phase transition across h c (T) 

By the previous results on the self averaging of Z with 

(fc 2 )} 



-2 In n+o(k) 





exp- 


[* 2 


H 








^ exp- 


/fc 2 



lnl/p _ /(2/3) 
c 2 



/(/?) - fh 



+ 



ii h > h c 
o(/fc 2 )} if h<h c 



(57) 



we can conclude that the line h c (T) represented in Figure 1 corresponds to a line of a 
first order phase transition. Indeed the function InZ turns out to be continuous with a 
discontinuous derivative in j3 when h = h c and we have — InZ = ^(-ff (c, r)) converges 



almost surely to 



9/9 

d_^ 2 (k 2 f'(2f3) + o(k 2 ) i£h>h c 
\k 2 (f'(p) + h) + o{k 2 ) i£h<h c 



(58) 



By the convexity property of the function InZ we can conclude with standard argu- 
nts that -§p lim^oo In Z = lim^^ In . 
evaluating directly the mean E fi2(H(a,r)) 



ments that A lim^oo In Z = lim^oo A In Z and so the same result can be obtained by 
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If we look at the model on the state space of couple of configurations, with Gibbs 
measure ^(a, r) = -^e _/3i ^ <T, ' r ), the two phases correspond to two different mean energies. 
As far as the second derivative is concerned we have 

2 { -k 2 f n (2(3) + o(k 2 ) iih>h c 

-^lnZ = var, 2 (H) = { _ _ c (59) 

d P 2 \-k 2 r((3) + o(k 2 ) ]£h<h c 

and again the same result can be obtained by evaluating directly the mean on the of 
the variance w.r.t. the pair measure Hi- 

In a similar way we can study the overlap q(a,r) by computing J^lnZ. Indeed 
V>2(q(p,r)) = k + j^j-hiZ; we obtain 

18 - [ if h > h c 
-'ZlnZl (60) 

P k dh [k if h<h c 
so that the two phases have not only different mean energies but also different mean 
overlap. 



5 A low temperature phase transition 

We prove in this section the last claim of our main theorem. The proof is divided in 
three steps. First, we made a few prelimiray remarks on the computation of the annealed 
partition function Z and we deduce an almost sure concentration property of the Gibbs 
measure /Lt2- Second, we translate this concentration property in a concentration property 
of the marginal law fx. Last, we evaluate the free entropy of the measure 7r CT for the typical 
configurations a by proving a last large deviation estimate. 

5.1 Concentration of the Gibbs measure // 2 

An alternative way to compute the annealed partition function consists in counting the 
mean number ftf(q, h, fo) of pairs of configurations (<r, r) with a given overlap q = \I\ := 
\af) r|, a given number l\ of missing links inside /, and a given number \<i of missing links 
between / and T := r\a, between S := a \ r and T, as well as S and /. We get 

2 = iliZ E e- mi+h+Hk - 9)] M(q,h,l2). (61) 

q=0 l 1= l 2 =0 
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To evaluate N{q,l\,l2) we use the following argument. Consider the obvious extension of 
the definition (|24"|) of Ho(cf,t) to a generic pair A, B of subsets of V: 

H (A,B)= Ji,jlA(i)l B (j). 
For a e {0, k} and I € {0, fco} let A := {A C a c : \A\ = a, and H (A, a) = I}, then 
E\A\ = (""*) (°* ) (1 " P) l P ak - 1 = e «M^-/,(^)]+ (fc 3 ) (62) 
where we denote by I p the large deviation functional 

: x G [0, 1] ^ xln— — + (1 - x) In — -. (63) 
1 — p p 

In Appendix [A] the main properties of this function are recalled; we just mention here 
that the function I p (x) is related to the function /(/?) used in section [2] by a Legendre 
transform, indeed we are doing the same computation of Z by using different variables. A 
similar computation can be given for subsets of a and so we can easily conclude that 

N(q, h,h) = n^q, l±)n s ,T(q, h) 

with 

M9 , (0 ^) exp{ -^ ( _^_ )} m 

ns^k) - ( 2( 7_;) ( 2 t7 ) ) -P{"(^ " ^(J^)} <«> 

and so 

with Q(q) defined in (|25l) . xi = ^pi) an d x 2 = The sums over Zi and can 

be written as sums over x\ and X2 and can be estimated with the saddle point method. 
We then obtain for Z the expression given in (I30p and (131 p by using the fact that / is 
the Legendre transform of —I. In the previous sections we used a different approach 
to estimate Z and Z because this decomposition becomes not easily dealt with as soon 
as second moment estimates are involved. However, the decomposition proposed here is 
useful to prove a concentration property for the Gibbs measure 
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Three simple remarks will be used in what follows. 

Remark 5.1 As far as equation i fglj) is concerned, we note that the set of values of density 
of missing links having positive entropy is given by 

X c :={x€[0,l]: l -^>I p (x)}. (67) 
Remark 5.2 Note that by the definition of cavity field §7ty we have 

J2 h i( S ) = ^HT) = H (S,T) 
so that H (S,T) = ^T^(S)+-£ ieS h t (T) 

Remark 5.3 The the density of missing links is typically constant in subsets of a given 
set. More precisely let A and B be a pair of subsets of V with \A\ = ak, \B\ = bk with 
a, b > and let p £ (0, 1). Then for every A\ C A with \Ai\ = a\k, a\ G (0, a) and for 
every p\ ^ p there exists 5(p±) > such that 

W(H (Ai,B) = p iai bk 2 \H (A,B) = pabk 2 ) < e ' S{pl)abk2 

The proof is an immediate consequence of the convexity of I p if we note that 

¥{H (Ai,B) = p iai bk 2 \H (A,B) = pabk 2 ) = 

= F(H (A U B) = Piai bk 2 , H (A 2 ,B) = p 2 (a - ai )bk 2 )e abk2l ^ +0 ^ < 

< e -a 1 bk 2 I p {p 1 )-{a-a 1 )bk 2 I v {p 2 )+abk 2 I v {p)+o{k 2 ) 

with A 2 := A\Ai and p 2 = p^ - pi^. 
For (a, r) in 

X k l) x x i?\ let us denote again the overlap by q(cr, r) and by Hq{g,t) 
the number of missing links between a and r. We define also 

Q(a, T):={i£T: h^S) £X c }U{ieS: K{T) g* X c } (68) 

that is the set of points in SL) T with non-typical number of missing links to the other set, 
and let q(a, r) := \Q(o-, t)\. Even if the energy H(a, r) does not depend on this parameter 
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q{a,r), as we will show in Section [5.31 the value of q(a,r) is crucial to perform entropy 
estimates at low temperature when h < h c . We then set, for any 5 > 0, 



Qs 



[1 -5,1] i£h> he, 



[0,5] 



if h < hr- 



[f (2/3) - 5, /'(2/3) + 5] iih> hs, 
[f((3)-5,f((3)+5] i£h<~h- c . 



(69) 



(70) 



and 



Qs:-- 



(71) 



[0, 1] if h > he or h < he and T > T c , 

[1 - 25, 1] if h < h 5 and T < T c . 
With these intervals of parameters we can define a set of typical pairs of configuration: 



(72) 



where we define 
have 



2(k-q(a,T)) 



when q(a, t) = k. By the self-averaging property of Z we 



E 

(<x,r)eE|, 



-f3H(a,r) 



1 



E 



-PH{<t,t) 



(73) 



Now, using the previous decomposition (|61|) and the fact that I p is strictly convex and 
more precisely that I' p ' > 2, we get, with the saddle point method, and using the remarks 
oTTllOllOthat 



E^ 2 (S C 2>5 )] <e 



-CS 2 k 2 



(74) 



for k large enough and a suitable constant C. Indeed 



EM£ c 2i5 )]<^- 



E 



-f3[2h+l 2 +h(k-q 



Ze o(k) A^i 

q=0,...,k xiG[0,l] 2; 2 G[0,1] 



)] AA(g,/i,/ 2 )+ 



+ 1^) E E ' E 'e^^^^-^/^/^ + e- 2 ^^ 2 (75) 

q=0,...,k Xl &[0,l] x 2 e[0,l] 

where the sums E^' are with the restriction given by a £ (T,2 : s) c , i-e., Yl q =o k ^ s w hh 
the condition q ^ Qs and Ylx- are W1 th the condition xi is such that Ip(x\) + 2(3x\ > 
7(2/3) + C(5) in the case h > h c and in the case h < h c with the condition X2 is such 
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that I p (x 2 ) +I3x 2 > f(P) + C(5), with C(<5) > % mm xe [ 0jl] F p (x) > |-2. Moreover 
the last term e ~ 2Cs2k2 estimates the mean of the measure of the pairs (a, r) such that 
<? ^' r - > G Q#, ^H (a, r ) ^ %<5 hut 2 (fc-g(CT > T )) 2(5- This can be obtained only in the case 
h < h s and T < T c and in this regime we have that /'(/?) -X" c an d the main contribution 
to Z is given by pairs of disjoint sets S and T with l-ffo(S') ^) — f'W)\ < ^ f° r a suitable 
constant -B. By remarks 1 5 , 1 1 and 15 . 2 1 we can conclude that S (and T) can be decomposed 
into two disjoint parts S = Si U S%, with \Si\ > 5k with different density of missing link 
to T. The estimate then follows by remark \5. 31 

We conclude, with Markov inequality and Borel-Cantelli lemma, that, almost surely, 

M2(S§ )5 ) < e" CT2fc2 / 2 (76) 

for k large enough. 

5.2 Concentration of the marginal measure /i 

Starting from £2,5 we want to define a set of typical configurations in Ai^ with the 
property that a G £<$ implies that 7r CT is concentrated on the configurations r such that 
(cr,r) G £2,5- 

To give a precise definition of this set we can proceed as follows. For all a in Xi. 



and a , a and p in [0,1], define 

(77) 



K fc 2(fe — a) k l 

a (a,a,p) := s a (a,a, p) - f3(p + h(l - a)) (78) 



s a (a,a,p) := In 



0* := max{0o-(a,a,p) : a,a,p£ [0, 1]} (79) 

Since the number of possible values of a, a and p for which s a (a, a, p) can be non-negative 
and finite is only polynomial in k, we note that, for all positive 5 and large enough k, the 
quantity In Z a is such that 

k 2 (f>l < In Z a < k 2 <P* a + 5k 2 . (80) 
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We then set, for any 5 > 0, 



£<5 := jcr G : there exist a G Qs, a G Qs, p G such that (j) a (ct, ol, p) > 4>*a ~ ^ j 

(81) 

Note that for a given positive 5, for A; large enough and for all a in £|, 



x . ( {T 6 *<»> : 6 & e &, e n s} ) <_ (82 , 



this means that we have a concentration property of 7r CT implying that for configurations 
a G" £5 the measure is concentrated on values of (a, a, p) not in Q<5 x Qs x Now, 
due to (fTBj) we have, for a given 5 > and k large enough, 



= E E ^( t ) + E ^) E ^( r ) ^ "Pixi - e ^ fc2/2 )- 

(83) 

We conclude, using the concentration property of p, that, almost surely, 

pi^fj < e- cs2k2 / 3 (84) 

for k large enough. 

5.3 Conclusion 

To estimate the entropy, up to o(k 2 ) 

S:=- E ^{<j)hx{p{a))=\nZ- E |pn Z CT = In Z + /3/i(F), (85) 

where, for any a in A^, F(a) := — ^ In Z a is the free energy associated with ir a , it is enough 
to estimate for all a in £5. Indeed, Z is self- averaging and we estimated lnZ up to 
o(k), moreover we have a polynomial uniform upper bound on F (polynomial in k), and 
an exponential concentration on £5. This implies that almost surely, for any 5 > 0, 

\p(F) - p(F\X s )\ <e~ cs2k2 l\ (86) 

We will estimate F(a), i.e., lnZ a , uniformely on £5 in the following cases: 
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(A) h > h B , 

(B) h < h B , and C[fi) < 0, i.e., T < T s , 

(C) h < h B , and C(/3) > 0, i.e., T > T e . 

For any positive 8, by definition of £5 for large large enough we have the following 
estimate for \w.Z a , for all (j in Sj, 

max (j) a (a,a,p) < 4>* < —^lnZ a < <fi* + 8 < max <p a (a,a, p) + 25. 

(87) 

We estimate max ag g j a£Q s ,peH6 ^(01,01, p) in the three different cases. 
Case (A): Since s a (a,p) > and 

ln(l/p) 

max s a (a, a, p) < (88) 

we have 

- /3(1 + h)8 - /?/'(2/3) <^lnZ a < -/3/'(2/3) + (2 + p)8 + ( 89 ) 
by using that c goes to c, we conclude that, almost surely, 

In cases (B) and (C) we will need a concentration result on the numbers of sites i 
outside a such that hi(a) = j + hk, i.e., g^\ = (see JHJ). 

Lemma 5.4 Lei 

«/ c := {j G N : | G A c } (91) 



wt/i X c defined in ((FJ\). With probability 1, for any 5 > 0, if k is large enough then, for 



all a in for j £ J c we have: 



for j J c we have 



; (92) 



9j,i < e kS . (93) 



27 



Proof: The random variable gj i follows a binomial law with parameters n — k < n and 

QVl - p) V _j = e -* J *07*)-H>(k) , (94) 
so that the usual large deviation estimates give 



9j,i > exp I <5 + 
< exp < — k ( 5 + 



MVg) _ t ( 3 
p [k 



7 /j^ ln(l/p) 



+ o(l) J e fe ( 5 +MVp)/c-/p07fc)]+) I (95) 



and, if m(l/p)/c > I p (j/k), i.e., j > j c 

exp |jfe + MlM _ /p (£))}) < exp {- e fc (Mi/p)/c-/ P (i/fe)+ (i))} . (96) 



5j,i 



Since the number of configurations a is not larger than e fc2 1ii ( 1 /p)/ c ) we obtain our result 
with the Borel-Cantelli lemma. □ 
By Lemma 15.41 we can obtain the following results: 

In case (B) T < T 5 i.e., /'(/?) 6 [0, l]\Xg, there exists a constant a2 such that, almost 
surely, for all k large enough, 

max s a (a, a, p) < a2<5 (97) 
This immediately follows from (|93|) . 

In case (C) T > T s i.e., /'(/3) S Xg, there exists a constant 03 such that, almost 
surely, for all k large enough, 

max s a (a, a, p) < - J p (/'(/3)) + a 3 <5 (98) 

The proof of this entropy estimates can be found in Appendix[Dj It is absolutely standard 
but we give it not only for completeness but also to show that the point of view of the 
Fermi statistics is a useful tool. The main idea is that in the asymptotics k — > 00, due 
to the convexity property of I p , the entropy is essentially due to the sites i with cavity 
field hi(a) such that G (/'(/?) + h — 8, /'(/?) + h + 8) and the number of such sites is 
estimated by Lemma [57 
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With the entropy estimates (j97H and (I98p we can easily complete our proof. 
Case (B): By equations (|S7J) and ([97]) if a is in £5, almost surely, for all k large enough, 

- pf'{p) -pS-ph<^\aZ a < -P?{p) +135- 0h(l -5) + a 2 5 + 25. (99) 

We conclude that, almost surely, 

lim * =2*^-/(0) + /J/'G8). (100) 

fc->+oo fe z C 

Case (C): Again by equations flSTJ) and (|98p if <r is in £5, almost surely, for all large 
enough, 

i In Z a < -Pf{p) + (35- fh(l -5) + - I p (f((3)) + a 3 5 + 25. (101) 

k z c 

We conclude that, almost surely, 



lim s 2 Mi/p) _ _ MVrt + (/ , + m _ hm 

k z c c c 



where we used that / is the Legendre transform of —I p . The opposite estimate 

Km ^ < ^ 

fe^+oo k c 

is trivial. 



A The functions f(j3) and I p (x) 

We give here some inequalities for the function f{/3) := — In \p + (1 — p)e - ^] defined in the 
main theorem. 

This is a non negative concave function with /(0) = 0, lim^oo /(/?) = In 1/p = I p (0); 
from its concavity we immediately obtain the following estimates: 

m > IM V / > 1 

B(0) :=/(/?) + /(2/3)-/(3/3)>0 
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Figure 3: The function /(/3) 



since both the functions F(/3) := f(/3) — an d B(/3) are strictly increasing function 
vanishing at zero. Moreover we have: 

/(2/?) + /(3/3) > /(/?) + /(4/3) 

which is an immediate consequence of concavity, and 

/(/3) + /(3/3)-/(2/3)-^M>0 

since f(fi) + /(3/3) - /(2/3) - H&l = f(fi) - 1%®. + /(3/3) _ ZiM _ again positive 
by concavity. 

For p £ (0, 1) and x G [0, 1] consider now the binomial large deviation functional 
defined in ([63]) . I p (x) = xln + (1 — x) In This is a convex non negative function 
with minimum at x = 1 — p and I p (0) = lnl/p, / p (l) = In 1/(1 — p). By recalling the 
asymptotic behavior for the binomial coefficient: 

(J) ~ (2vr)- 1 / 2 [x :E (l - x) 1 " x ]~ L (x(l - x)^)- 1 / 2 

with x = j; (see for instance [1] pg.4) we immediately obtain 

Ly i _ p) l pL -l = e -LI p{: c) + o(L) 
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Figure 4: The function I p {x) for p = 2/3 

The functions f(f5) and I p (x) are related by a Legendre transform. Indeed we have 

I P (x) +(3x> ftf) 

where the equality holds only for x = /'(/?) . By evaluating the critical point of the function 
I p {x) + fix we have 

W = h (i-,Ki-x) = - < ' 



ans so the critical point is 



p + (1 — pje p 



and this is a minimum due to the convexity of I p (x). 
In particular we have 

I p n (x) = —^—>2 
x{l — X) 

B Proof of Lemma 13.11 

Indeed to prove (|50p we note that, the coefficients C r in 

<p = g5(g5 2 ~ 1} (2/(2/3) - /(4/3)) + i ^ 5r C r 

r=l 

defined in equations gU, (gSJ), (@SJ), gZJ, (03D, gS}, flUD, can be estimated 

by using the concavity of the function /(/?) so that < 2/(/3) - /(2/3) < /(/3) + /(2/3) - 
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/(3/3) =: B. Indeed by using the constraints (|35p and (I36p we can estimate the coefficient: 



C r <B(kAg) C 5 < B(g - g B ) 



so that 



* - .'/-.('/:, - J ) ( /(2/3) - f -^P) + \B((k A g) + sbXs - 55 ) = * 



C Proofs of Lemmas Q and Q 



Proof of Lemma 13.21 We look now for the maximum of the function 



6 2 + $ 2 + ^ = (Ak-q-q'-g) Inn-ln ((q- g)\(q' -g)l\ -21n((k- q- g)\(k-q' -g)l) +C 



+ $2 + 55(55 - 1) (7(2/3) - ^M) + ±B((k Ag) + g 5 )(g - g 5 ) (103) 

By noting the symmetry of this function in the parameters q and q' and the fact that the 
constraints are in the form g < (2k — q) A (2k — q') and g§ < q A q' A g , we immediately can 
conclude that the maximum is obtained for q = q' ■ So we have only to study the function 
a(q,g,95) ~ b((li9,g<o) on the polytope V where 



a(q, g, g 5 ) = ~2ph(k -q)- 2f((3)(k 2 - q 2 ) - f(2(3)q(q - 1) + g 5 (g 5 - 1) (/(2/3) 



+ 



+ -B((k Ag)+ g 5 )(g - g 5 ) + (4k -2q-g)\nn 
b(q, g,g 5 )=2ln((q-gy)+Aln((k-q- 9 y))+C 



(104) 
(105) 



and V (see Figure 2) is defined by the relations: 



0<g<2k-q, 0<g 5 <gAq, 0<q<k 



(106) 



We first study the maximum of the function a on V. For g > k the hessian of a is given 
by 

/ [4/0?) - 2/(20)] \ 

is 

V \B [2/(/3) - /(4/3) — B] J 



(107) 
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and for g < k the hessian of a is given by 



[4/(/3) - 2/(2/3)] \ 

B \B 

\ \B [2/09) - /(4/3) - B] ) 



(108) 



Again by the concavity of the function /(/?) in both cases we have a positive eigenvalue 
Ai = 4/(/3) — 2/(2/3) and two real eigenvalues with A2 + A3 > if g < k and A2A3 < 
if g > k so at least two positive eigenvalues. We can conclude that the maximum of a 
is obtained on the edges of P. By studying the function a(x), with x = {q,g,g§), on 
all the edges we easily check that the maximum actually is obtained on the vertices. To 
this purpose we used the convexity relations of /(/3) listed in appendix El By a direct 
comparison we obtain that the maximum is obtained in the point x max = (k, 0, 0) for 
h > h c and in x max = (0,0,0) for h < h c as soon as /(2/3) - |/(4/3) < MjM. This 
inequality holds for all /3 when c G (1,2], while in the case c > 2 we can simply add the 
hypothesis (3 < (3 C to conclude. 

Fix now a £ (0, 1), in the region VC\ {g < k a } we have that a(x) — 6(x) is a decreasing 
function of g at q, g§ fixed and large k, and on the surface g = 55 again is a decreasing 
function of g for large fc. On the other hand we have for x G P Pi {g > k a } that a(x) < 
a(x max ) — b(x max ), so that, as in the discussion of Z, by noting that the function b is 
non-negative, we can conclude that the points x. max correspond to maximal values for the 
function a(x) — 6(x). 



D Proof of equation ( 1981) 

We have to estimate 

N(a,a,p) := \{r G : q(a,r) = ka, H (a,r) = k 2 p}\, (109) 

for a G [0,5] and p G [/'(/3) - 6,f(J3) + 5] with f'(J3) G X c . We have H (a,r) = 
H (a, I) + H (a, T) and so we get N(a, a, p) = J2 P 'e[ P -8,p+28] N i( a > a > p') n 2^, a, p') with 

Ni(a,a,p') = \{A G V\a : \A\ = (1 - a)k, H (a,A) = k 2 (l -a)p'}\, 



33 



N 2 (a,a,p') = \{A G a : 141 = ak, H Q (a, A) = k 2 a P — ^}|. 

a 

The term N2 is easily estimated from above by 2 k = e°( fc2 ). As far as the term iVi is 
concerned we can use the notation of the Fermi statistics and in particular (|15p . to write 

9j,i 



*i(W) = n(„:J (110) 

"j,l3=k 2 (l—a)p' 

By using the Stirling formula we can approximate the binomial coefficient as follows: 

9) = e -ge(* )+o (fc ») (ni) 

n I 



with 



obtaining: 



£{x) := xlnx + (1 — x) ln(l — x) 



E II ( 9i ' 1 ) = explmax^-^O^)] + o(A; 2 )} 



{™j,i}j=o,i,-.-,fe : i i 

Ej "j,l = ( 1 -Q) fe , 



with x = (xj)j g { ,i,...,fc}; where Xj := and the maximum is under the constraints 

• gj,ixj = (1 — a)k and ^ ■ gj,iXjj = k 2 (l — a)p' . With the Lagrange multiplier method 
and standard computation, we can evaluate this maximum by looking at the maximum of 
the function 

F( X ,X,p) = Y,9 3 ,i[- - (X + pj)xj] (112) 

3 

which is reached in x with aij = 1+e I +Mj with A and p solution of the equations 

^9j,\Xj = (1 - a)k and ^ g jt ixjj = k 2 (l - a)p'. (113) 

3 3 

In this maximum x we have 

Y^[-9j, l( f (%)] = A ( X - a ) k + K 1 - a ) k2 p' + °( fc2 )- ( 114 ) 
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By Lemma 15.41 we have that for j S J c , Xj must be exponentially small in k and for any 
5 > we have 

jeJc jeJc 

Due to the fact that /'(/?) £ X c , this sum is not exponentially small, i.e., 

fc Jnl/p _ _ x _ kji 3_ = Q{5k) 

for some j E J c , and so we can conclude that 

max&[^^ - -\-kp J - = k^-^- - f(n)] - A = 0(<5fc) 

that is A = fc[^±& - /(p)] + 0(5fc) and so, by (HUD that 

Ai(<t, a, p') < exp - /(p)] + 0(flfe)}(l - a)fc + p(l - a)fcY} = 

= exp{^(l - a)[^ - /(p) + up'] + 0(6k 2 )} 
c 

By recalling that p' e [p — (J, p + 25] = [/'(/?) — 25, /'(/?) + 3(5] and the Legendre transfor- 
mation between / and I p implying that p/'(/3) = /(p) — I P (/'(/?)) the proof of ([97|) and 
(198|) follows straightforward. 
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